Skip to main content

Designating a partition column

To designate a partition column, do the following:

  1. Navigate to the data source page for which you want to AI generate the metadata catalog.

  2. Go to the Tables tab and click the link of the specific table you wish to catalog.

  3. In the Table Info page, click the Column info tab.

  4. From the extreme right click the more options icon corresponding to a column and select the Make partition column option. This option is available only for columns with following data types: int, bigint, timestamp, long, and date.

    designating-partition-column
  5. In the Partition Column Settings box, select the Enable checkbox. You can also click Advance Settings and specify the partition size.

    designating-partition-column1

    The column that is specified as the partition column is ticked in the listed.

    designating-partition-column2

    The designated column will be used as the partition column by Apache Spark while processing datasets. Only a single column can be used as a Partition Column. The default partition size is 50K.

    info

    This setting is available only for RDBMS data sources.

    warning

    If you attempt to change the partition column, a warning is displayed, since this action can impact the pipeline and retrieval efficiency.

    designating-partition-column3